Name | Version | Summary | date |
mineru |
2.1.4 |
A practical tool for converting PDF to Markdown |
2025-07-23 07:53:18 |
linkrot |
5.2.2 |
Extract metadata and URLs from PDF files |
2025-07-22 18:53:37 |
BrazilFiscalReport |
0.5.33 |
Python library for generating Brazilian auxiliary fiscal documents in PDF from XML documents. |
2025-07-22 17:36:15 |
docling |
2.42.1 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. |
2025-07-22 16:47:03 |
asposepdfcloud |
25.7.0 |
Aspose.PDF Cloud |
2025-07-22 16:44:38 |
ocr-document-converter |
3.1.0 |
Enterprise-grade OCR and document conversion tool with dual OCR engines |
2025-07-22 15:19:03 |
markitdown-pdf-separators |
0.4.1 |
MarkItDown with PDF page separators - convert PDFs to Markdown with page boundary markers |
2025-07-22 14:44:59 |
product-connections-manager |
1.0.1 |
A comprehensive platform for managing Product Connections operations with automated EDR printing |
2025-07-22 09:43:37 |
exparso |
0.0.3 |
Analyzing and parsing documents |
2025-07-22 09:43:16 |
simulchip |
0.2.1 |
Compare NetrunnerDB decklists against local card collection and generate PDF proxies |
2025-07-22 05:38:57 |
codex |
1.8.4 |
A comic archive web server. |
2025-07-21 23:40:03 |
comicbox |
2.0.1 |
Comic book archive multi format metadata read/write/transform tool and image extractor. |
2025-07-21 16:13:39 |
llm-data-converter |
2.1.6 |
Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract |
2025-07-21 12:19:10 |
txt2ebook |
0.1.149 |
CLI tool to convert txt file to ebook format |
2025-07-20 08:44:12 |
wdoc |
3.3.0 |
A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (url, pdf, epub, youtube (incl playlist), audio, anki, md, docx, pptx, oe any combination!) |
2025-07-19 12:44:46 |
gs-pdf-compress |
0.2.1 |
Compress PDF files with Ghostscript |
2025-07-18 23:29:09 |
pdfix-sdk |
8.7.2 |
PDFix SDK - Automated PDF Remediation, Data Extraction, HTML Conversion |
2025-07-18 06:51:27 |
pdf-tools-mcp |
0.1.3 |
A FastMCP-based PDF reading and manipulation tool server |
2025-07-18 03:32:46 |
o7pdf |
1.0.1 |
PDf Reports |
2025-07-17 13:24:06 |
txtify |
0.1.2 |
A versatile Python tool to convert documents (PPTX, DOCX, PDF, XLSX) to plain text, ideal for providing context to AI code assistants like GitHub Copilot and Amazon CodeWhisperer. |
2025-07-17 03:24:47 |